AMENDMENTS TO THE CLAIMS 

Please amend claims 30-32 as indicated below. Please add new claims 33-37. 
/ 

Claims 1-9 (Cancelled) 
1-9. Cancelled 



1 0. (Previously Added) A processor comprising: 

a memory to store a first packep data operand having a first plurality of data elements and 
a second packed data operand /having a second plurality of data elements; 




a partial-width packed data i 
second packed data operand 
of corresponding pairs ofMat 



struction to indicate the first packed data operand and the 
ind to indicate a first operation to be performed on a subset 
elements of the first and the second packed data operands; 



a decoder coupled with the 
and to decode the partial-witii 



a partial-width execution unit 



ipemory to receive the partial- width packed data instruction 
th packed data instruction; and 



coupled with the decoder to execute the operation on the 



subset of corresponding pairsj of data elements. 
(Previously Added) The processor of claim 10: 



decoder 



wherein the decoder is a 
a first micro instruction that cc : 
pair of data elements of the 
instruction that corresponds to 
elements of the first and the 



fint 



to convert the partial-width packed data instruction into 
rresponds to a first subset of at least one corresponding 

and the second packed data operands and a second micro 
a second subset of at least one corresponding pair of data 
se|cond packed data operands; and 



Attorney Docket No.: 42P5193C 
Application No.: 09/852,217 



Wherein the partial-width execution unit is a partial-width execution unit to execute an 
operation specified by the first micro instruction on the first subset. 

12. (Previously Added) The processor of claim 11, further comprising a port to receive at 
least one data element of the first sublet and to not receive a data element of the second 
subset. 



1 3 . (Previously Added) The processor of claim 1 1 : 

wherein the processor is a processor to eliminate the second micro instruction; and 

wherein the processor is a processor to set at least one result data element corresponding 
to the second subset to a predetermined value. 

14. (Previously Added) The proces; or of claim 1 1 : 



further comprising delay circuitry 
and 



wherein the partial-width execution 
operation specified by the seconc 
after the delay. 



1 5 . (Previously Added) The procei >sor of claim 1 0 : 



further comprising a first port co pled 
operand and a second port coupled 
receive the second packed data oberand 



to delay execution of operations on the second subset; 



unit is a partial-width execution unit to execute an 
partial-width micro instruction on the second subset 



with the memory to receive the first packed data 
with the memory to substantially simultaneously 



further comprising divide circui 
subset comprising at least one 



ty 



da ta 



to divide the first packed data operand into a first 
element and a second subset comprising at least one 
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data element and to divide the second paciced data operand into a third subset comprising 
at least one data element and a fourth subset comprising at least one data element; and 

wherein the partial-width execution unii is a partial-width execution unit to perform the 
first operation on at least one corresponding pair of data elements of the first and the third 
subsets to generate at least one resulting data element. 



1 6 . (Previously Added) The processor of claim 1 5 : 

further comprising delay circuitry tp delay the second subset and to delay the fourth 
subset; and 

wherein after the delay, the partial-width execution unit is a partial-width execution unit 
to perform the first operation on at least one corresponding pair of data elements of the 



17. 



18. 



19. 



second and the fourth subsets to 



generate at least one additional resulting data element. 



(Previously Added) The processor 
is a partial-width execution unit 
corresponding to the second an<!l 
resulting data element to a predetermined 



(Previously Added) The processor 
is a partial-width execution uni t 
similar packed data instruction 



(Previously Added) The processor 
is a 64-bit partial-width execution 
operands are 128-bit operands 



of claim 15, wherein the partial-width execution unit 
to generate at least one additional resulting data element 
the fourth subsets by setting the at least one additional 
value. 



of claim 15, wherein the partial-width execution unit 
to execute the packed data instruction and a second 
on a half clock cycle. 



of claim 15, wherein the partial-width execution unit 
unit, and wherein the first and the second packed data 



20. (Previously Added) A methoc^ comprising: 
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receiving a packed data instruction that specifies memory locations of a first full-width 
packed data operand having a plurality off data elements and a second full-width packed 
data operand having a corresponding plurality of data elements; 

substantially simultaneously accessing Ahe first full-width packed data operand and the 
second full-width packed data operandi from the memory locations; 

dividing the first full-width packed data operand into a first subset of data elements and a 
second subset of data elements and dividing the second full-width packed data operand 
into a third subset of data elements and a fourth subset of data elements; 

performing an operation specified py the packed data instruction on the first and third 
subsets of data elements to generate a first resulting one or more data elements; 

delaying the second and fourth subsets of data elements; 



after said delaying, performing 
the second and the fourth subsets 
more data elements; and 



operation specified by the packed data instruction on 
of data elements to generate a second resulting one or 



storing the first and the second r ^suiting data elements in a common packed data operand. 



2 1 . (Previously Added) The meth )d 
specified by the macro instruction 
data element to a predetermined value. 



22. (Previously Added) The method 
bit packed data operand into a 6 
segment of two high order data € 



23. (Previously Added) A processc 



of claim 20, wherein performing an operation 
on the second and fourth subsets comprises setting a 



of claim 20, wherein dividing includes dividing a 128- 
bit segment of two low order data elements and a 64-bit 
ements. 



r composing: 
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a packed data instruction to specify an operation on a plurality of data elements of at least 
one packed data operand; 

a decoder to generate a first micro instruction and a second micro instruction 
corresponding to the packed data instruction, the first micro instruction specifying a first 
operation and the second micro instruction specifying a second operation; 

an execution unit to execute an operation specified by the first micro instruction on only a 



subset of the plurality of packed data e 



ements; and 



circuitry to eliminate the second micro instruction 



create the second micro instruction b> 



24. (Previously Added) The processor of claim 23, wherein the decoder is a decoder to 



replicating the first micro instruction to create a 



replica and modifying the replica to c *eate the second micro instruction 



25. (Previously Added) The processor 
execution unit to set a data element ir 
value. 



Df claim 23, wherein the execution unit is an 
a result packed data operand to a predetermined 



26. (Previously Added) A method comprising: 

receiving a packed data instruction specifying memory locations of a first packed data 
operand and a second packed data opejrand; 



converting the packed data instruction 
second packed data micro instruction; 



into a first packed data micro instruction and a 



executing the first packed data micro 
elements of the first and the second 
corresponding data elements from the 



instruction including accessing only a subset of data 
packed data operands comprising at least one pair of 
rst and the second packed data operands and 
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0- 




causing an operation specified by the packed data instruction to be performed on the 
subset to produce a resulting one or more data elements; and 

executing the second packed data micro instruction including accessing only a subset of 
data elements of the first and the second packed data operands comprising at least one 
pair of corresponding data elements /from the first and the second packed data operands 
and causing an operation specified by the packed data instruction to be separately 
performed on the subset to produce a resulting one or more additional data elements. 



(Previously Added) The methoc 
micro instruction includes setting 
elements to a predetermined value 



of claim 26, wherein executing the second packed data 
a data element of the one or more additional data 



28. (Previously Added) The metho i of claim 26, further comprising 



writing the resulting one or more 

r 

writing the resulting one or more 
operand. 



29. (Previously Added) The methcjd 
micro instruction is delayed relative 



30. (Currently Amended) A process )r comprising: 



a memory to store a first packed 



data elements to a result packed data operand; and 



additional data elements to the same result packed data 



of claim 26, wherein executing the second packed data 
to executing the first packed data micro instruction. 



lata operand and a second packed data operand; 



an instruction to indicate the first packed data operand and the second packed data 
operand and to indicate an operation to be performed on -a- the first packed data operand 
and the second packed data operand; 

decoder means for decoding to decod e the instruction; and 
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Execution means for executing to e x e dute the instruction. 

3 1 . (Currently Amended) The processor of claim 30, wherein the decoder means is a decoder 
means for decoding to d e code the instruction into a first micro instruction that specifies 
an operation on only a portion of the first and the second packed data operands and a 
second micro instruction that specifies an operation on only a different portion of the first 
and the second packed data operand 

32. (Currently Amended) The processor of claim 30, wherein the execution means is an 
execution means for performing to p e rform operations specified by the instruction on a 
first subset of corresponding pairs pf data elements of the first and the second packed 
data operands and after a delay to perform operations specified by the instruction means 
on a second subset of corresponding pairs of data elements of the first and the second 
packed data operands. 

33. (New) A computer system composing: 
a bus; 

a storage device including a flajsh memory coupled to the bus to store data; 

a processor coupled to the storage device by the bus to execute instructions; 

a memory of the processor to store a first packed data operand having a first plurality of 
data elements and a second packed data operand having a second plurality of data 
elements; 

a decoder of the processor coupled with the memory of the processor, the decoder to 
receive a partial-width packed data instruction and to decode the partial-width packed 
data instruction, wherein the partial width packed data instruction indicates the first 
packed data operand and the second packed data operand, and indicates a first operation 
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to be performed on a subset of corresponding pairs of data elements of the first and the 
second packed data operands; and 

a partial-width execution unit of the processor coupled with the decoder to execute the 
operation on the subset of corresponding pairs of data elements. 

34. (New) The computer system of claim 33: 

wherein the decoder is a decoder tfo convert the partial-width packed data instruction into 
a first micro instruction that corresponds to a first subset of at least one corresponding 
pair of data elements of the first And the second packed data operands and a second micro 
instruction that corresponds to a (second subset of at least one corresponding pair of data 
elements of the first and the second packed data operands; and 



wherein the partial-width execu 
operation specified by the first 



35. (New) The computer system of claim 34 



ion unit is a partial-width execution unit to execute an 
ihicro instruction on the first subset. 



wherein the processor is a proc 

wherein the processor is a processor 
to the second subset to a predetermined 



36. (New) The computer system p. 



further comprising a first port 



sssor to eliminate the second micro instruction; and 



to set at least one result data element corresponding 
value. 



claim 33: 



coupled with the memory to receive the first packed data 
operand and a second port coi pled with the memory to substantially simultaneously 
receive the second packed data operand; 



further comprising divide circuitry to divide the first packed data operand into a first 
subset comprising at least one a^ta element and a second subset comprising at least one 
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data element and to divide the second packed data operand into a third subset comprising 
at least one data element and a fourth subset comprising at least one data element; and 

wherein the partial-width execution unit is a partial-width execution unit to perform the 
first operation on at least one corresponding pair of data elements of the first and the third 
subsets to generate at least one resulting data element. 

37. (New) The computer system of claim 36: 

further comprising delay circuitry to delay the second subset and to delay the fourth 
subset; and 

wherein after the delay, the partial-widt i execution unit is a partial-width execution unit 
to perform the first operation on at least one corresponding pair of data elements of the 
second and the fourth subsets to generate at least one additional resulting data element. 
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